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discussed  in  this  report  disambiguates  references  from  discourse  and  interprets  the  purpose  of 
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disambiguate  references  with  minimal  search.  Focus  and  a frames  representation  of  the  discourse 
make  it  possible  to  interpret  discourse  purposes.  The  focus  and  representation  of  the  discourse  are 
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OlKOurM  and  Co-rafaranc*  in  PAL 


1 


: 


1.  Mraduclian 

Evary  discoursa  in  English  consists  of  ona  or  more.santancas  wNch  craata  a genarai 
contaxt  of  paopia,  placas,  objacts,  times  and  actions.  The  spaaKar  of  tha  discourse  ganarally 
will  not  relate  raferancas  from  ona  santanca  to  tha  previous  in  vny  direct  fashion  nor  indicate 
how  the  requests  or  assertions  of  each  santanca  In  the  discourse  are  connected.  For  tha 
fo  interpret  the  speaker’s  discourse  and  dacide  what  tha  speaker  is.  requesting  or 
assarting,  the  hearer  must  complete  two  tasks,  among  others:  (1)  disambiguate  tha  refarantial 
terms  for  their  intar-sentential  and  axtra-sanlantiai  links,  and  (2)  determine  the  purpose  of 
each  senteiKe  in  the  discourse.  The  first  of  these  two  tasks  makes  it  possible  to  know  what 
entities  the  speaker  is  referring  to.  The  second  task  results  in  establishing  a connected 
discourse  and  understanding  what  the  speaker  wants  to  communicate.  Interpreting  tha 
discourse  purposes  of  various  sentences  explains  why  D1  is  acceptable  below  (even  though 
Dl-2  does  not  mention  the  party)  while  D2  Is  unacceptable.  A theory  of  reference 
disambiguation  will  explain  the.  disambiguation  of  Ns  to  Bruce  and  not  to  Mike,  in  03. 

01 - 1  John  is  having  a party  at  his  house. 

2 I think  the  guest  of  honor  is  Mary  as  they  are  going  to  announce  the  publication 
of  Mary’s  book. 

02- 1  Henry  wants  to  meet  with  Harold. 

2 Sing  a song  before  3 on  Thursday. 

03- 1  I want  to  have  a meeting  this  week. 

2 BrucSj  will  be  the  guest  lecturer. 

3 He  will  speak  on  slavery  in  ant  colonies. 

4 Mike  wants  to  read  h|^  report  before  the  talk. 

An  explanation  of  fhese  phenomena  underlies  the  research  being  conducted  at  the  MIT  AI  lab 
on  PAL.  While  PAL  is  designed  to  understand  the  English  form  of  requests  for  arranging 
various  events,  the  design  depends  upon  a theory  about  how  to  interpret  a speaker’s^ 
extended  discourse.  PAL  acts  as  a modet  of  e hearer  in  these  discourse  situations.  Two 
problems  that  must  be  solved  before  PAL  can  understand  requests  in  extertded  discourse  are 
referential  disambiguation  and  discourse  purpose  interpretation.  TNs  paper  reports  on 
'progress  on  these  two  problems. 

A sample  scenario  of  what  PAL  is  designed  to  do  is  given  in  D4  below. 

04- 1  I want  to  schedule  a meeting  with  Dave. 

2 It  should  be  at  3 p.m.  on  Thursday. 

3 We  can  meet  in  Ns  office. 

4 Invite  Bruce. 


i 'I 


2 


r 

i 


1.  I will  use  the  term  speaker  to  refer  to  the  producer  of  a spoken  or  written  discourse  and 
heerer  to  refer  to  the  receiver  of  the  discourse. 
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Discourse  end  Co*re(er#rKe  in  PAL 


2.  Definition  of  Disceurse 

First,  a "discourse"  must  t>e  defined.  I tsKe  a discourse  to  be  any  connected  piece 
of  text  or  spoken  language  of  more  than  one  sentence  or  independent  sentence  fragment. 
Ideally,  every  discourse  is  about  some  central  concept  which  is  then  elaborated  by  the  clauses 
of  a discourse.  Speakers  often  produce  discourses  which  fail  to  meet  this  specification 
because  they  talk  la)  about  several  concepts  without  relating  them  or  lb)  without  informing 
the  hearer  that  several  concepts  will  be  discussed  at  or>ce  or  2)  because  there  is  no  central 
concept  in  their  discourses.  However,  this  idealization  will  serve  to  introduce  some  important 
terms.  Multi-concept  discourses  do  occur,  and  can  be  described  usirtg  an  approach  which  is  a 
germralized  version  of  that  presenlad  in  tNs  paper.  Some  cases  of  mulli-concept  discourse 
are  discussed  in  Buliwinkle  [1977].  However,  the  theory  presented  here  has  been  tested  in  a 
running  implementation  of  PAL,  and  this  paper  is  restricted  to  that  tested  model. 

In  previous  work  [Winograd,  1971;  Rieger,  1973;  Cherniak,  1972]  various  structures 

for  referencing  were  assumed.  Winograd  used  lists  of  entities  of  the  same  semantic  type  and 

chose  referents  for  anaphoric  terms  based  on  recency  and  likelihood  in  the  proper  .semantic 

class.  His  mechanism  was  loo  simple  and  failed  to  account  for  numerous  anaphoric  cases  as 

* 

well  as  being  limited  to  objects  in  a closed  world.  Rieger  postulated  memory  structures  from  a 
conceptual  dependency  representation  of  the  sentences  of  a discourse.  The  men«ory 
structures  were  used  to  infer  other  information  that  could  be  urdfied  to  determirw 
co-reference.  His  algorithms  suffer  from  the  explosive  number  of  infererKOS  that  can  be  made 
from  each  memory  structure.  Charnlak  supposed  that  there  were  large  collections  of 
inference  rules,  called  demons,  which  knew  what  to  do  with  a small  piece  of  the  total 
knowledge,  and  which  fired  whenever  that  knowledge  was  encountered.  TWs  theory 
represents  overkill;  if  one  could  have  as  many  demons  as  Cherniak  supposed  and  get  them  to 
fire  when  that  knowledge  occurred,  the  mechanism  could  be  used  to  predict  co-referentiality 
of  referencial  terms.  However,  controlling  the  multitude  of  demons  is  difficult^  , and 
furthermore  one  cannot  imagine  how  such  a collection  of  knowledge  is  learrmd  in  the  first 
place. 

To  interpret  definite  noun  phrases  and  anaphora,  a different  approach  is  taken  in 
PAL.  It  is  assumed  that  discourse  contains  a structura,  which  when  represented,  can  constrain 
the  interpretation  of  referential  terms.  From  the  discourse  structure,  ri^s  have  been 
discovered  which  govern  the  acceptability  of  referential  terms  in  different  discourse 
situations.  The  interpretation  of  references  is  not  strictly  deterministic;  it  is  like  knowing 


2.  Rosenberg  [personal  communcation]  has  created  a device  called  sentirmis  which  may 
partially  solve  this  problem. 
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~ 5 • DitcouTM  and  Co-rafaranca  in  PAL 


which  of  tavaral  placat  to  looK  in  tha  ditcourta  for  a co-rafarant  and  trying  out  tha  tarm 
found  thara. 

Tha  thaory  undartying  PAL  distinguiahaa  two  Kinda  of  rafarring.  Tha  firat  ia  an 
intarnal  rataranca  batwaan  a noun  phraaa  and  aoma  pra-axialing  databasa  objact.  That 
databaaa  objact  rapraanta  a raal  world  antity.  In  Flgura  I balow  Intarnal  rataranca  llnka  tha 
noun  phraaa  NPl  *Jmmy  Cartar”  to  a rapraaantation  of  Jmmy  Cartar  (who  ia  daacribad  aa 
praaidant  of  tha  US.  ate.).  How  that  databaaa  objact  rafara  to  tha  raal  world  ia  tha  claaaical 
aamanlic  problam  of  rataranca  (cf.  Kripka  [1972]  among  othara)  and  ia  bayond  tha  acopa  ?l 
tNa  work.  Tha  othar  Kind  of  rafarring  ia  eo-rafaran^.  Co-rafaranca  linKa  a noun  phraaa  to 
anolhar  noun  phraaa.  Tha  two  noun  phraaaa  ara  aaid  to  co-rafar,  and  both  ratar  to  tha  aama 
databaaa  objact.  In  Figura  1,  tha  daahad  link  from  NP2  *Jmmy"  to  NPl  ia  a co-rafaranca  link. 
Tha  dot-daah  link  from  NP2  to  tha  databaaa  objact  ia  a virtual  intarnal  rataranca  link  which 
reautta  from  tha  co-rafaranca  link  from  NP2  to  ffPl  and  from  tha  intarnal  rataranca  lirtk  from 
NPl  to  tha  databaaa  objact.  Intarnal  rataranca  and  co-rafaranca  lir«ka  ara  diatlnguiahad 
bacauaa  co-rafaranca  linka  can  ba  aatabliahad  mora  aaaily  uaing  diacouraa  atructura.  In  tha 
ramaindar  of  tNa  papar  whan  I apaak  of  Intarnal  rataranca,  I will  drop  tha  phraaa  "intarnal" 
and  uaa  oNy  "rataranca." 


Fig.  1 . Rataranca  Linka  Batwaan  Noun  Ptwaaaa 


co-rafaranca  ^ 


NPl  "Jmmy  Cartar* 


intarnal 
rataranca  I 


NP2  "Jmmy' 


virtual  intarnal  rataranca 


Oatabaca  Ropratantalion  of  Jmmy  Cartar 
Nama:  Jmmy  Cartar 
occupation:  Pratidant  of  U$ 
Nrthplaca:  Gaorgia 
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OiscouTM  and  Co-refarence  in  PAL 


3.  Tha  ConcapI  of  Fecw 

Tha  central  concept  of  a discourse  may  be  alaboratad  by  savaral  santancas  of  tha 
discourse  and  than  either  discontinued  In  favor  of  a related  concapti  or  dropped  in  favor  of  a 
new  concept.  This  central  concept  of  a discourse  is  called  the  discourse  focus  or  simply  the 
focus.  This  term  was  first  used  by  Geos*  [1977J.  A simple  example  of  focus  is  meeting  In  04 
repeated  below: 

04-1  I want  to  schedule  a meeting  with  Dave. 

2 It  should  be  at  3 p.m.  on  Thursday. 

3 We  can  meet  in  his  office. 

'4  Invite  Bruce. 

All  four  sentences  give  information  about  the  focussed  entity.  The  focus  is  what  makes  a text 
or  a set  of  utterances  a discourse. 

In  this  work  the  focus  is  assumed  to  be  a concept  to  which  other  concepts  are 
associated.  Some  of  the  association  links  ara  "built-in"  in  the  sense  that  they  exist  previous 
to  the  discourse.  For  example  with  meeting,  built-in  association  links  include  that  a meeting 
has  a time,  a place,  a set  of  participants,  and  a topic  of  discussion.  These  association  links  are 
distinguished  in  the  sense  that  the  concept  has  explicit  links  to  these  concepts  while  no 
explicit  links  exist  to  other  concepts  such  as  color,  cost  or  aga.  The  discourse  often  serves 
the  purpose  of  specifying  more  about  the  concepts  linked  to  a focus.  In  04-1,  thare  is  certain 
information  about  who  the  participants  are,  while  04-2  specifies  the  time.  04-3  causes  the 
hearer  to  infer  that  the  office  is  a place  for  a meeting,  because  the  focus  meetir^  has  a place 
associated  with  it,  artd  because  PAL  expects  to  be  intormed  about  the  concepts  associated  to  a 
meeting. 

In  PAL  the  association  links  between  concepts  are  easily  expressed  in  the  frames 
structure  of  FRL  [Goldstein  and  Roberts,  1977).  A frame  for  a meeting  has  slots  for  times, 
places,  participants  and  so  on.  it  is  exactly  these  slots  that  serve  the  purpose  of  association 
links  to  other  concepts.  One  purpose  of  a discourse  with  PAL  is  to  fill  those  slots  with  values 
and  required  information.  As  I will  discuss  in  tha  section  on  the  use  of  definite  noun  phrases, 
the  values  given  to  those  slots  are  also  useful  in  interpreting  co-reference  and  in 
ur>der standing  the  purpose  of  a sentence  of  the  discourse. 

Focus  also  serves  as  the  central  index  point  for  co-referencing.  The  focus  is  what 
is  going  to  be  talked  about  in  the  discourse.  When  it  Is  introduced,  it  is  new  information. 
Thereafter  it  is  the  given  information,  and  more  new  information  is  added  to  it.  Knowirtg  what 
the  focus  is  helps  determine  co-reference  relations  because  old  Information  can  be 
pronominalized  while  new  information  cannot.  If  a focus  is  seen  not  just  as  an  entity  by  itself 
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but  conn«ct«d  to  othor  ontitios,  tocut  indicalo$  how  thOM  ontifiot  con  bo  co-rofor|ntt  o*  woll. 
In  04-(2-4).  the  foeut  of  moohnf  con  bo  uood  to  dotormino  tho  co-roforonco  of  it,  wo  and  N» 
of  Ns  oWco:  it  mutt  co-rofor  to  tho  focut,  wo  to  thoto  indivkfualt  ottocialod  to  tho  foeot 
who  includo  tho  tpoohor,  and  Ns  to  an  individual  attociatod  to  tho  focut  who  it  not  tho 
tpoahor  and  hat  maio  fondor.  Tho  foeut  It  utod  at  on  accott  function  for  rotrioving  tho 
co-roforoni  of  a particular  noun  phrato.  Lator  in  tNt  papor,  rulot  govorning  tho  uto  of 
anaphora  by  moant  of  tho  focut  of  tho  ditcourto  will  bo  ditcuttod 

In  tho  curroni  vortion  of  PAL,  foeut  it  choton  at  tho  firtt  noun  phrato  following  tho 
vorb  if  ono  oxittt,  oito  tho  tubjoct  it  utod  at  focut.  Thit  mothod  of  chooting  focut  it 
adequate  for  current  PAL  ditcourtot  but  not  tufficiont  for  tho  mott  general  cate.  See  Sidnor 
[forthcoming]  for  a full  ditcutsion  of  focut  choice.  Once  a focut  it  choton,  It  can  bo  utod  in 
succeeding  sontoncos  to  dotormino  tho  co-roforonco  of  pronount  or  dofinito  noun  phratot  at 
woll  os  to  chocK  to  too  If  tho  ditcourto  it  ttill  connected.  A tontonco  like  (la)  below  followed 
by  (lb)  it  a disconnected  discourse  bocauto  tho  co-roforontial  terms  in  (lb)  ore  unrelated  to 
tho  focut  of  (la)  bated  on  tho  attociation  links  present  in  tho  database. 

(la)  1 want  to  moot  with  Henry. 

( lb)  Qivo  mo  an  ice  cream  cone. 

Tho  focut  of  tho  ditcourto  can  bo  changed  wNlo  maintaining  a connected  ditcourto. 
Tho  chief  moans  are  end  of  discourse  remarks  and  focut'Shift.  End  of  ditcourto  remarks  can 
bo  explicitly  slated  ones  like  "Thai's  all,"  or  Implicit  Ones,  such  at  the  act  of  simply  ending  tho 
input  stream.  A lost  reliable,  implicit  marking  of  tho  and  of  ditcourto  it  to  uto  a tontonco 
with  unrelated  co-roforonlial  terms.  In  tho  cate  above,  (la)  followed  by  (lb)  could  bo 
assumed  to  bo  two  separate  ditcourtot.  Thit  cate  it  lost  reliable  because  it  it  impottiblo  to 
toll  If  tho  speaker  attumet  that  the  ice  cream  cone  it  related  (at  it  often  the  cate  with  a 
rw>n-ldeal  speaker)  or  whether  the  speaker  Intends  to  change  tho  ditcourto  to  a now  ono.  At 
present  PAL  does  not  accept  tNs  kind  of  abrupt  ditcourto  changoi  iiwtpad  PAL  indicates  that 
such  a tontonco  it  not  inlolligiblo  in  the  ditcourto.  A more  topNsticatod  PAL  might  request 
that  tho.  speaker  explain  how  It  it  that  (lb)  it  related  to  the  ditcourto. 

* 

Tho  othor  moant  of  changing  tho  focut  1 call  focut-sNft.  A ditcourto  may  expand 
various  aspects  of  a focut  and  then  choose  one  aspect  of  tho  focut  to  dotcribo  in  detail.  For 
example,  in  a cHtcourte  about  meetingt,  we  may  want  to  tpend  teveral  tentoncet  specifying 
the  time  fpr  tho  mooting,  why  that  time  it  best  and  to  on.  When  tinw  it  being  ditcuttod,  orw 
would  like  to  know  that  tho  focut  hat  changed  to  that  attortiont  or  roquottt  con  bo  taken  to 
bo  about  time.  However,  tho  mooting  focut  may  bo  brought  back  Into  tho  ditcutsion  lotor.  To 
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casa 

frama 

intarpratar 


mappint 

procaaaor 


ctwafaranca 

procaaaor 


diacourae 

intarpratar 


achaduia201 

achaduia 

"imparativa* 

PAL 

maating203 


a-Kind-of 

typo 

actor 

event 


frame 


meelint203 

meetint 

offi€e207 


a-Kind-of 

place 

determiner 


office207 

office 

my209 


a-Kind-of 

determiner 


frame  - my209 

a-Kind-of  « my 

Given  theae  framea,  PAL  ia  expected  to  determine  what  my209,  office207,  and  meetir)g203 
co-refer  to.  PAL  atao  muat  decida  what  the  purpoaa  of  an  imperative  achaduling  requeat 
(repreaanted  by  acheduta201)  ia  relative  to  ita  databaae  collection  of  actiona.  Each  of  these 
modulea  will  now  be  diacusaed  in  detail. 


9.  Snterpratation  of  Oiacouraa  Purpoaea 


To  interpret  discourse  purposes,  a discourse  module  creates  a model  of  the 
discourse  and  controls  the  process  of  focus  identification.  Since  the  beginning,  middle  and  ertd 
of  a discourse  each  require  different  actiona  by  the  PAL  scheduler,  the  discourse  component 
models  each  differently.  The  first  sentence  of  the  diacourae  ia  assumed  to  specify  what  the 
nature  of  the  user's  communication  ia.  TNs  is  a simplified  view  of  the  real  communication 
process.  Many  discourses  do  not  simply  slate  their  object  and  then  elaborate  the  relevant 
information.  Inataad  many  speaKars  begin  a diacouraa  as  in  06  below  in  which  the  first 
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••nt«nc*  contain*  a raaaon  for  tom*  olh*r  action,  which  ia  r*qu*Bt*d  in  a later  a*nt*rK*. 
Other  ditcourae*  may  introduce  individual*  or  object*  to  the  hearer  for  later  comment  on 
them. 

06;  I am  going  on  vacation  the  beginnirtg  of  next  week.  John  want*  to  ***  me,  *o 
schedule  our  regular  meeting  aaaalon  before  I leave. 

The  currant  version  of  PAL  use*  the  •impllfied  view  of  discourse  to  choose  a 
discourse  purpose.  Introductory  sentences  are  assumed  to  be  making  some  tort  of  request. 
The  PAL  discourse  module  chooses  which  request  on  the  basis  of  the  verb  and  any  associated 
modals,  or  on  the  basis  of  verbs  of  desire  {w»nt,  ¥ri$h,  wotMd  Hk»)  and  the  verb  complement. 

A request  consist*  r>ot  only  of  the  request  type,  but  of  tom*  object  wNch  the  request  is 
about  (intransitive  verbs  are  not  relevant  to  PAL  since  telling  PAL  to  laugh,  run  or  groan  it 
inappropriate).  The  focus  of  the  discourse  is  used  for  this  purpose.  This  choice  Is  plausible 
not  only  because  the  focus  Is  closely  associated  with  the  object  of  the  verb,  but  alto  because 
a discourse  centers  discussion  on  some  particular  entity,  and  that  entity  Is  captured  by  the 
focus. 

Once  a focus  has  been  designated,  sentences  occurring  In  mid*dl*cours*  are 
assumed  to  be  about  the  focus  until  the  co*re(*r*nca  module  predicts  a focus-shltt  and/or 
until  the  verbs  used  are  inconsistent  with  the  discourse  request.  Mid>dlscours*  sentences 
often  do  not  explicitly  co-rafer  to  the  focus  as  has  bean  shown  previously  ip  Di  and  D4t  they  I 
may  contain  an  Implicit  focus  co-reference.  Us*  of  focus  for  co-reference  disambiguation  ha* 
the  added  benefit  that  sentences  containing  Implicit  focus  co-reference*  era  easily  recognized 
by  the  discourse  component.  Once  an  Implicit  focus  relation  I*  established,  the  module  can  go 
onto  predictions  of  focus  shift.  Knowledge  that  the  speaker  is  co-referring  to  the  focus, 
either  explicitly  or  implicitly,  makes  possible  the  prediction  that  the  discourse  I*  not  yet 
complete,  and  the  prediction  that  the  speaker  Is  making  a coherent  request.  SirK*  neither 
prediction  can  be  assumed  trivially  true,  the  focus  I*  important  to  the  communication  process. 

In  addition  to  the  focus,  the  discourse  module  contains  knowledge  allowing  the 
module  to  decide  If  the  verb  of  a new  senterK*  is  consistent  with  the  discourse  request.  Thus 
In  07  below,  the  second  sentence  uses  a verb  that  Is  consistent  with  the  scheduling  request 
while  In  07*,  the  verb  It  odd. 

07:  Henry  wants  to  meet  with  Harold.  Choose  a time  before  3 on  Thursday. 

07’t  Henry  wants  to  meet  with  Harold.  Sing  a song  before  3 on  Thursday. 

The  knowledge  needed  to  predict  consistency  Is  represented  In  the  frames  database  In  two 
way*.  First  the  fraik*  for  the  discourse  request  contain*  information  about  whet  other 
requests  can  be  sub-requests  of  the  discourse.  Second  a set  of  mapping  frames  contain 
information  which  determine  how  a verb  can  be  Interpreted  at  maklr>g  a certain  request.  For 
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'•xampla,  tha  vtrb  ba  can  ba  aasociatad  with  achaduling  and  ra-achaduUng  activitiaa.  Howavar, 
tha  inlantion  of  tha  apaaKar  in  a aantanca  lika  (2)  ia  diffarant  witNn  tha  contaxt  of  a 
achaduling  or  a ra-achaduling  raquaat. 

(2)  Tha  lima  ahould  ba  3 pm. 

In  a achaduling  contaxt,  (2)  can  ba  inlarpratad  to  raquaat  that  tha  tinta  ba  aatabliahad  aa  3 pm 
whila  (2)  in  ra-achaduling  can  hava  an  intarpratation  of  changing  tha  tima  from  whatavar  it 
was  to  3 pm.  PAL  capturaa  tha  inlantion  of  tha  apaaKar  ralativa  to  a raquaat  contaxt  by  an 
infararv'9  machaniam  which  ia  a matchar  that  dalarminaa  that  (2)  rapraaantad  aa  a frama^  can 
ba  aaac«iatad  with  achaduling  raquaata  by  a aimpla  mapping  batwaan  two  framaa.  TNa 
corraapondanca  couplad  with  tha  uaa  of  focua  maKaa  it  poaaibla  to  undaratand  (2)  aa  part  of  a 
diacouraa. 

In  addition,  tha  mapping  functiona  tall  how  to  intarprat  tha  currant  aantanca  into 
orm  of  tha  commanda  which  tha  achadular  can  parform.  Includad  in  tMa  procaaa  ara  how  to 
map  tha  alota  of  ona  frama  into  a frama  wNch  ia  tha  achaduling  Ktioa  For  axampla,  tha  varb 
frama  for  "Wa  can  maal  in  823*  is  mappad  from  a "maal*  frama  into  a frama  callad  "aaaarl" 
with  a alot  for  tha  objact  assartad,  which  ia  tha  focua,  and  a alol  for  what  ia  aaaarlad  about 
that  objact,  in  this  casa  tha  piKa  as  823. 

Tha  and  of  a diacoursa  ia  currantly  inlarpratad  aa  baing  tha  and  of  tha  apaaKar’a 
input  atraam.  A mora  aophiaticalad  maana  of  intarprating  diacouraa  and  ia  poaaibla,  though 
not  implemenlad,  given  tha  focus  mechanism:  whan  tha  needed  slots  of  tha  focua  are  fitlad, 
tha  speaKer  can  be  considered  to  have  finished  tNs  discourse.  Upon  sensing  tha  and  of  tha 
discourse,  tha  discourse  module  informs  the  achadular  that  it  can  carry  out  tha  action 
raquaalad  at  tha  discourse  beginning.  At  first  glarKO  tNa  may  appear  aa  if  tha  diacouraa 
request  specified  at  the  beginning  is  ignored  in  favor  of  other  raquaala.  In  fact  the  initial 
request  is  used  in  intarprating  mid-discourse  aanlencas.  However,  many  discourse  actions  like 
achaduling  require  that  tha  action  of  achaduling  ba  delayed  until  all  the  necessary  Information 
for  schaduiing  is  praaantad.  This  procaaa  normally  cannot  ba  stated  in  a single  aantanca,  and 
a whole  discourse  ia  rtaadad  to  fill  in  tha  request.  In  tMa  fasNon  tha  diacouraa  module 
reflects  the  fact  that  a discourse  consists  of  many  sub-discourses  cantered  around  individual 
aniilias  and  which  ara  opened  and  cioaad  by  focus  sMfting  or  finisNng  discussion  of  tha 
currant  focua. 

s 

PAL  ia  similar  to  tha  GUS  system  [Bobrow  at  al,  1977]  because  It  expects  a 
diacouraa  to  provide  information  about  tha  abla  of  a frama.  GUS  permits  user  irtiliativa 


5.  A frama  is  not  taken  aa  tha  meaning,  in  tha  classical  aamantic  aanaa,  for  <2)i  PAL  makes  no 
claims  about  this  aanaa  of  meaning. 


J 
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although  It  ia  unclaar  wfhat  tha  axtant  of  thia  initiativa  is.  GUS  does  not  seam  to  allow  for  user 
initiativa  of  tha  discoursa  raquasts.  Sinea  PAL  axpacts  full  usar  control  oyar  all  parts  of  tha 
discoursa,  PAL  naads  a complata  dascription  of  tha  discoursa  and  its  focus.  PAL’s  usa  of  focus 
also  peasants  a complata  thaory  of  tha  Kinds  of  co-rafaranca  problams  raisad  by  tha  GUS 
systam. 


6.  Co-rafaranca  Disambiguation 

Thara  ara  two  sub-modulas  for  co-rafaranca  Intarpratatlon  in  PAL,  tha  santantial 
and  intar-santantial  co-rafaranca  modutas.  Tha  intar-santantial  co-rafaranca  sub-modula 
chooses  co-referances  for  refarantial  terms  in  tha  discoursa  onca  tha  focus  is  identifad.  Tha 
task  of  datar mining  co-rafaranca  varies  depending  upon  tha  presence  or  absaiKa  of  previous 
discoursa.  Whan  there  is  previous  discoursa,  co-rafaranca  Interpretation  depends  largely  on 
tha  focus.  For  simple^  definite  noun  phrases,  PAL  assumes  either  the  focus  is  tha  direct 
co-rafarant  of  the  definite  noun  phrase  or  tha  focus  contains  a slot  that  is  the  co-rafaraiKO  of 
the  definite  noun  phrase.  This  assumption  needs  modification  since  soma  definite  noun 
phrases  are  used  to  refer  outside  the  context  of  the  discourse.  For  example,  whan  trying  to 
schedule  a meeting,  if  the  speaker  says  (3^  tha  definite  noun  phrase  co-rafars  to  an  entity 
.associated  with  tha  meeting  under  discussiont  that  association  is  reflected  in  tha  frame  slot 
structure  of  FPL 

(3)  Tha  bast  place  is  my  office. 

However,  if  tha  speaker  says  (4),  th0  conf0r»nc0  room,  i.a.'  that  particular  conference  room 
which  the  speaker  has  in  mind,  is  not  associated  with  maatirtgs  in  general,  and  so  tha  focus 
does  not  point  out  tha  co-rafarnnea. 

(4)  Wa  ought  to  meat  in  tha  conference  room. 

However,  by  searching  tha  focus,  tha  lack  of  a connection  can  be  noticed,  and  a raferanca 
from  tha  database  can  than  be  considered.  In  tNs  way,  tha  focus  acts  as  an  access  function, 
but  only  for  those  co-rafarantial  terms  related  to  tha  previous  santarKas  of  tha  discoursa. 

PAL  uses  database  search  with  growing  contexts  of  raferanca  to  choose  raferanca 
for  other  kir>ds  of  noun  phrases' which  refer  to  entities  outside  tha  discoursa.  Growing  a 
context  is  accomplished  using  tha  immediate  sat  of  frames  from  tha  first  sentaiKa  and 
recursively  creating  largar  sals  from  tha  slot  values  of  those  frames  until  tha  frame  with  tha 
name  in  question  is  found.  Tha  context  growing  mechanism  reduces  search  from  a more  global 
search  strategy,  and  helps  control  potential  ambiguities  that  exist  due  to  multipio  possible 

6.  A simple  definite  noun  phrase  is  a dafinita  noun  phrase  containing  no  ralativa  clauses.  At 
present  PAL  interprets  only  such  noun  phrases. 
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references  in  the  database.  This  same  method  could  be  used  for  definite  noun  phrases  that 
refer  outside  the  discourse. 


Use  of  the  focus  is  actually  somewhat  more  complex  since  the  definite  noun  phrase 
may  be  a co-reference  to  the  potential  focus  of  the  discourse.  Should  a definite  noun  phrase 
co-refer  to  the  potential  focus,  the  discourse  module  pushes  the  current  focus  to  a focus  stack 
and  takes  the  potential  focus  as  the  new  focus.  The  pushed  focus  is  available  for  later  use  In 
the  discourse.  The  current  inter-sentential  sub-module  does  not  interpret  definite  r>oun 
phrases  used  generically.  The  focus  can  be  used  for  these  cases  as  well  (see  Sidner, 
[forthcoming]),  but  the  details  of  this  process  are  not  included  In  the  current  version  of  PAL 

The  inter-sentential  co-reference  sub-module  also  determines  the  co-refererKe  of 
personal  pronouns.  For  the  pronouns  of  first  person  plural  (we,  us),  two  choices  can  ho  made. 
First  the  sub-module  can  choose  the  focus  as  the  direct  co-referent  of  the  anaphor.  Second 
the  sub -module  can  choose  a set  of  co-references  from  a particular  slot  of  the  focus.  That 
slot  must  contain  co-references  including  the  speaker  of  the  discourse.  For  he/she,  and  Its 
object  forms,  the  focus  is  chosen  as  a direct  co-reference.  Using  the  focus  as  co-referent 
explains  the  anaphoric  co-reference  In  08  of  his  to  Bruce  and  rather  than  Mike.  ' When  the 
focus  is  not  the  co-referent,  a co-referent  stipulated  by  the  co-reference  rules  of  the 
sentential  co-reference  sub-module,  discussed  below  Is  used.  Finally  If  neither  is  acceptable, 
entities  associated  with  the  focus  are  checked  for  co-reference.  This  sub-module  predicts 
misuse  of  he/she  prorwuns  if  no  co-references  ore  found  from  this  process  or  If  more  than 
one  results  from  the  last  step  in  the  process. 

The  interpretation  of  co-reference  for  he/shs  pronouns  needs  to  be  expanded  to 
irKlude  consideration  of  potential  focus  since  in  D8  below,  his  co-refers  to  Bruce  and  not  to 
Mike. 

08:  I want  to  have  a meeting  this  week.  Brucej  will  be  the  guest  lecturer.  Mike  wants 
to  read  hiS|  report  first. 

It  appears  that  the  focus  and  potential  focus  ought  to  be  checked  for  co-reference  to  such 
pronouns  before  sentential  co-reference  rules  are  used.  However,  further  experimentation 
with  such  cases  is  needed  to  confirm  this  aspect  of  co-reference. 


For  the  co-reference  of  it,  the  inter-sentential  co-reference  sub-module  chooses  a 
co-referent  either  from  the  focus,  the  potential  focus  or  from  predictions  from  sentential 
co-reference  rules,  which  are  discussed  below.  ITtls  choice  strategy  Is  fwt  entirely  adequate 
because  recency  appears  to  play  a rola  in  tha  co-raferance  choices  for  it.  Recency  rules  are 
'discussed  In  Sidner  [forthcoming],  and  coulij  be  included  in  a future  version  of  PAL.  The 
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inter -sentential  co-reference  sub-module  uses  the  semantic  constraints  placed  on  the  pronoun 
by  the  verb  in  a few  instancesi  this  portion  of  PAL  could  be  expanded  greetly.  Co-reference 
rules  for  they  work  similarly  to  those  for  it  with  consideration  that  the  speaker  cannot  be 
included  in  the  co-reference  set. 

When  no  previous  discourse  exists,  PAL’s  sentential  co-reference  sub-module  uses 
the  co-reference  rules  of  Lasnik  [1976]  to  choose  co-referertces.  The  rule  is  stated  as 
follows:  If  a noun  phrase,  NPj,  precedes  another  noun  phrase,  NP21  NP2  is  not  a pronoun, 
and  further  if  the  minimal  cyclic  node  dominating  NPj  also  dominates  NP2,  then  NP2  •»’<*  *^2 
are  disjoint  In  reference.  The  expression  "disjoint  In  reference"  is  taken  to  mean  have  rto 
references  in  common,  thereby  blocking  the  co-reference  of  Bob  and  Tom  to  they  In  <5). 

(5)  They  assume  that  Bob  will  talk  to  Tom. 

By  using  Lasnik’s  rule,  disjoint  references  of  a noun  phrase  In  a sentence  can  be  chosen,  as 
well  as  a list  of  acceptable  co-references  for  the  noon  phrase.  This  information  is  recorded  In 
the  frame  presenting  the  noun  phrase.  As  pointed  out  by  Reinhart  [1976],  Lasnik’s  rule  fails 
to  predict  the  disjoint  references  in  sentences  like  (6)  and  (7)  below,  but  these  cases  ere  not 
problematic  given  inter-sentential  co-reference  rules  because  other  rules  will  predict  the 
co-reference  for  the  pronouns  first. 

(6)  Near  Dan,  he  saw  a snake. 

(7)  For  Ben’s  wife,  he  would  give  hit  life.  j|j 

. In  addition  to  the  use  of  a co-reference  rule,  the  sentential  sub-module  determines  f | 

the  referents  of  proper  names.  Using  the  collection  of  frames  which  make  up  the  discourse,  e ij 

frame  containing  the  correct  first  (and  if  given,  last)  name  can  be  found.  Should  the  immediate 
discourse  fail  to  produce  the  name  referent,  a larger  context  can  be  grown  from  the  slot 
values  and  from  the  slot  defaults  of  the  frame  representing  the  focus.  The  same  context 
growing  mechanism  used  for  definite  noun  phrases  is  used.  By  this  process  of  context 

growing,  ambiguous  uses  of  names  like  John  can  be  avoided.  John  will  refer  to  that  person  ; 

most  closely  associated  with  the  discourse.  If  more  than  one  frame  for  the  name  John  is 

I : ' 

found,  the  context  growing  process  predicts  that  the  speaker  has  used  the  name  embiguously. 

Context  growing  has  .been  effective  in  a limited  number  of  cases  tested  so  far,  although  a 
database  with  more  potential  ambiguities  would  further  test  this  sub-module.  ■ 
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7.  Exlantlont 

Tha  currant  PAL  can  ba  axpandad  In  many  directiona.  Soma  of  tha  nacassary 
davalopmants  of  its  co-raferanca  capabilitias  twva  alraady  baan  diacustad.  Sigrtificantly, 
thasa  capabilitias  do  not  raquire  axtanslva  now  thaoraticsl  appparatuai  tha  focus  of  discourse 
artd  structura  of  FRL  can  sustain  tha  naadad  improvamants.  In  discourse  intarpratation  PAL 
must  ba  extended  to  interpret  discourses  wNch  define  new  people,  places,  events,  actions  artd 
like  objects  as  wall  as  to  interpret  prefarancas  of  users  and  purposes  for  Various  activities. 
These  extensions  r»ot  only  will  make  PAL  a more  useful  system,  but*  also  they  encompass  a sat 
*of  tasks  useful  for  other  interactive  programming  domains.  Experimentation  on  tha  discourse 
noodula  of  PAL  is  need  to  incorporate  thasa  new  capabilitias. 
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